SYDNEY CMCRC at TAC 2013

نویسندگان

  • Glen Pink
  • Andrew Naoum
  • Will Radford
  • Will Cannings
  • Joel Nothman
  • Daniel Tse
  • James R. Curran
چکیده

We use a supervised whole-document approach to English Entity Linking with simple clustering approaches. The system extends our TAC 2012 system (Radford et al., 2012), introducing new features for modelling local entity description and type-specific matching as well type-specific supervised models and supervised NIL classification. Our rule-based clustering takes advantage of local description and topics to split NIL clusters. The best system uses supervised entity linking and local description type clustering and scores 72.7% B+ F1 score. Our KB clustering score is competitive with the top system at 71.4%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

(Almost) Total Recall - SYDNEY CMCRC at TAC 2012

We explore unsupervised and supervised whole-document approaches to English NEL with naı̈ve and context clustering. Our best system uses unsupervised entity linking and naı̈ve clustering and scores 66.5% B+ F1 score. Our KB clustering score is competitive with the top systems at 65.6%.

متن کامل

Naïve but effective NIL clustering baselines - CMCRC at TAC 2011

This paper describes the CMCRC systems entered in the TAC 2011 entity linking challenge. We used our best-performing system from TAC 2010 to link queries, then clustered NIL links. We focused on naı̈ve baselines that group by attributes of the top entity candidate. All three systems performed strongly at 75.4% B F1, above the 71.6% median score.

متن کامل

Document-level Entity Linking: CMCRC at TAC 2010

This paper describes the CMCRC systems entered in the TAC 2010 entity linking challenge. The best performing system we describe implements the document-level entity linking system from Cucerzan (2007), with several additions that exploit global information. Our implementation of Cucerzan’s method achieved a score of 74.9% in development experiments. Additional global information improves perfor...

متن کامل

Basis Technology at TAC 2013 Entity Linking

Basis Technology participated in the TAC Entity-Link task of the Knowledge Base Population track at TAC 2013. This paper describes the system we developed and runs submitted for English, Chinese, and Spanish evaluation. The system is an extended and improved version of the system used in TAC 2012. We focus on the novel components and error analysis.

متن کامل

WebSAIL Wikifier: English Entity Linking at TAC 2013

In this paper, we report on our participation in the English Entity Linking task at TAC 2013. We present the WebSAIL Wikifier system, an entity disambiguation system that links textual mentions to their referent entities in Wikipedia. The system uses a supervised machine learning approach and a string-matching clustering method, and scores 58.1% B+ F1 on the TAC 2013 test set.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013